CHAPTER 16 Getting Straight Talk on Straight-Line Regression 215
You should proceed with straight-line regression when one or more of the follow-
ing are true:»
» You want to test whether there’s a statistically significant association between
the X and Y variables.»
» You want to know the value of the slope and/or intercept (also referred to as
the Y intercept) of a line fitted through the X and Y data points.»
» You want to be able to predict the value of Y if you know the value of X.
Understanding the Basics of
Straight-Line Regression
The formula of a straight line can be written like this: Y
a
bX . This formula
breaks down this way:»
» Y is the dependent variable (or outcome).»
» X is the independent variable (or predictor).»
» a is the intercept, which is the value of Y when X
0.»
» b is the slope, which is the amount Y changes when X increases by 1.
In straight-line regression, our goal is to develop the best-fitting line for our data.
Using least-squares as a guide, the best-fitting line through a set of data is the
one that minimizes the sum of the squares (SSQ) of the residuals. Residuals are the
vertical distances of each point from the fitted line, as shown in Figure 16-2.
FIGURE 16-2:
On average, a
good-fitting line
has smaller
residuals than a
bad-fitting line.
© John Wiley & Sons, Inc.